On speech intelligibility estimation of phase-aware single-channel speech enhancement
نویسندگان
چکیده
To reduce time and costs in the development process of noise reduction algorithms, an objective intelligibility measure is crucial. Such a measure has to show high correlation with speech intelligibility determined by real listening experiments. In the past several measures were found that perform reliable in a particular scenario when only the spectral amplitude of a noisy signal is modified. Recent studies demonstrate the positive impact of a phase modification in a single-channel speech enhancement showing improved speech intelligibility while conventional methods relying on amplitude-only modification are known for reduced intelligibility. Further, another recent study shows that a distortion metric defined on the spectral phase outperforms state-of-the-art quality metrics when used in phaseaware speech enhancement. This raises two questions we account for in this work; First, to study the reliability of the existing intelligibility measures in predicting the performance of the phase-aware methods, and second to investigate candidates for new phase-aware instrumental metrics and evaluate their reliability in terms of intelligibility prediction. Our objective and subjective evaluations demonstrate that CSII-based and STOI as well as the proposed phase-aware metrics perform as reliable speech intelligibility estimators following the subjective results.
منابع مشابه
Noise Estimation in Single Channel Speech Enhancement Using FFT
Conventional speech enhancement methods typically utilize the noisy phase spectrum for signal reconstruction. This letter presents a novel method to estimate the clean speech phase spectrum, given the noisy speech observation in single-channel speech enhancement. The proposed method relies on the phase decomposition of the instantaneous noisy phase spectrum followed by temporal smoothing in ord...
متن کاملIterative refinement of amplitude and phase in single-channel speech enhancement
While the state-of-the-art speech enhancement methods are focused on the modification of the noisy spectral amplitude, our recent findings demonstrate positive impact of incorporating the speech phase spectrum in speech enhancement. In this show and tell proposal, we demonstrate the recent progress towards utilizing the phase information in closed-loop iterative manner leading to the joint enha...
متن کاملDual-Channel Speech Intelligibility Enhancement Based on the Psychoacoustics
In this paper, we propose an algorithm which enhances the speech intelligibility using the properties of human auditory system. In previous algorithms related to the speech intelligibility, the improvement in intelligibility has been mostly incorporated in a single-channel environment where the speech and noise signals are mixed together. But the speech enhancement problem of dual channel, in w...
متن کاملTwo-Stage Temporal Processing for Single-Channel Speech Enhancement
Most of the conventional speech enhancement methods operating in the spectral domain often suffer from spurious artifact called musical noise. Moreover, these methods also incur an extra overhead time for noise power spectral density estimation. In this paper, a speech enhancement framework is proposed by cascading two temporal processing stages. The first stage performs excitation source based...
متن کاملRobust Speech Recognition Using Speech Enhancement
Automatic Speech Recognition (ASR) has matured into a technology which is becoming more common in our everyday lives, and is emerging as a necessity to minimise driver distraction when operating in-car systems such as navigation and infotainment. In “noise-free” environments, word recognition performance of these systems has been shown to approach 100%, however this performance degrades rapidly...
متن کامل